Segmental feature extraction and coding for speech synthesis
نویسندگان
چکیده
This paper describes a segmental feature extraction and speech coding method in an acousticarticulatory domain using nomograms that represent a mapping between formant frequencies and articulatory parameters. The vocal tract model is a modified Fant model, in which we newly introduced a parameter for successively adjusting vocal tract lengths. We investigated first the relationship between formant contours and those of articulatory parameters and found the effectiveness of the articulatory domain for organizing acoustic-phonetic features with little dependency upon languages. Next, we applied the method to the low bit rate coder and confirmed that good quality speech synthesis was achieved in the condition of 18 bit used for articulatory code words.
منابع مشابه
Segmental Featurs Extraction and Coding for Speech Synthesis
This paper describes a segmental feature extraction and speech coding method in an acousticarticulatory domain using nomograms that represent a mapping between formant frequencies and articulatory parameters. The vocal tract model is a modified Fant model, in which we newly introduced a parameter for successively adjusting vocal tract lengths. We investigated first the relationship between form...
متن کاملFeature extraction by auditory modeling for unit selection in concatenative speech synthesis
A comprehensive computational model of the human auditory peripherals was applied to extract basic features of speech sounds. The auditory model extracts features by the auditory temporal coding mechanism in addition to features by the auditory place coding mechanism which has traditionally been used as spectral features. It also considers the nonlinearity of human auditory responses. Several s...
متن کاملMFCC and its applications in speaker recognition
Speech processing is emerged as one of the important application area of digital signal processing. Various fields for research in speech processing are speech recognition, speaker recognition, speech synthesis, speech coding etc. The objective of automatic speaker recognition is to extract, characterize and recognize the information about speaker identity. Feature extraction is the first step ...
متن کاملFeature extraction in opinion mining through Persian reviews
Opinion mining deals with an analysis of user reviews for extracting their opinions, sentiments and demands in a specific area, which can play an important role in making major decisions in such area. In general, opinion mining extracts user reviews at three levels of document, sentence and feature. Opinion mining at the feature level is taken into consideration more than the other two levels d...
متن کاملA novel feature extraction strategy for multi-stream robust emotion identification
We investigate an effective feature extraction front-end for speech emotion recognition, which performs well in clean and noisy conditions. First, we explore the use of perceptual minimum variance distortionless response (PMVDR). These features, originally proposed for accent/dialect and language identification (LID), can better approximate the perceptual scales and are less sensitive to noise ...
متن کامل